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ABS TRACT 


There is a growing need within the Navy for methods of detecting 
discrete narrowband signals in a non-stationary background. This paper 
concerns itself with the application of digital processing and spectral 
analysis techniques toward that goal. The use of the fast Fourier 
Transform in estimating the power spectrum of a signal is described. 

The method involves sectioning the time record, making "raw" estimates 

of the spectrum from these sections, and averaging these "raw" estimates. 
It is shown that more stable estimates are available if the segments are 
overlapped and an optimumamount of overlap for the case of the Hanning 
Window is found. It is shown that the stability of these spectral esti- 
mates can be interpreted as processing gain in the case of a discrete 
narrowband signal in additive noise. And finally, a brief description of 
signal detection theory applied to a human observer is presented to empha- 
size the flexibility that a human operator can bring to a signal detection 


system. 
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I. INTRODUCTION 


Today's ASW signal processing systems continue to grow in complexity 
and while many functions will be automated in systems now under develop- 
ment, the human operator will continue to play a key role in the proces- 
sing of data and the making of decisions. Man provides a flexibility to 
the system that is not available through complete automation and it is, 
therefore, imperative that he be considered an integral part of the 
system. 

In the past, assumptions have been made regarding the requirements 
that could be imposed upon the operator, without any precise knowledge 
of how an operator's performance actually would be influenced by the sus- 
tained demands of a — In general, it apparently has been assumed 
by system designers that the operator was like any of the hardware com- 
ponents that go into a system, i.e. that he would continue to perform at 
a level throughout a mission approximately equal to that at which he 
began. Common sense does not support this assumption, but to assume 
otherwise introduced difficult problems for which no useful answers 
were available. It seemed wise, or at least expedient, to let the 
characteristics of the electronics determine the direction of design and 
the development of procedures for the use of the system, with only casual 
consideration of the characteristics of the operator. However, studies 


of the role of the operator indicate that the rate of degradation of 


operator efficiency, and therefore system effectiveness, is a function 
of time. And it is much more rapid than even the most critical "user" 


or the most knowledgeable designer would have thought reasonable when 





the systems were much less complicated, and demands on the operator were 
much less drastic than at present. 

The quality of the operator's decision is based, to a great extent, 
on his insight and understanding of the problem. Thus, the primary objec- 
tive of a system which utilizes a human observer is to provide the oper- 
ator with the necessary insight and understanding of the problem to enable 
him to make the best decision possible. To do this, the display should 
utilize natural qualities of human perception so as not to drain the 
operator psychologically or expect him to make dicisions based on infor- 
mation he is not able to fully perceive. 

The quality of the operator's decision is also based on the accuracy 
with which the information provided him actually represents the physical 
situation. The purpose of the passive ASW processor is to detect the pre- 
sence of certain narrowband signals with random phase variation from a 
background of random ocean noise. It has been shown that the optimum 
receiver, in a maximum likelihood sense, for detection of such signals 
is an infinite set of Fourier filters followed by incoherent integration 
integration of power and a decision element (Williams, 1972). The pre- 
sent level of technology indicates that digital processors offer the high 
resolution, high dynamic range, repeatability, and flexibility necessary 
to effectively produce such a receiver. 

The purpose of this paper is to discuss the digital processing of 
acoustic signals with application to a new generation of passive ASW sign- 
al processors being developed today. Specific mention will be made of 
techniques in the "Spotlight' processor under development at the Naval 
Postgraduate School by Professor G.A. Rahe. 

General digital processing considerations are discussed. The statis- 


tical properties of the spectral estimate are presented as well as methods 





for improving them. The processing gain available from a system, which 
utilizes the FFT as a filtering device and incoherently integrates the 
power, is derived and finally a brief description of detection theory 


applied to a human observer in a sequential observation task is discussed. 








II. DATA ANALYSIS 


A. NOISE REPRESENTATION 

Ihe ambient acoustic wave fields in the ocean are often quite ener- 
getic and contain a wide range of frequencies. The study of these noise 
fields forms an important and necessary prelude to any serious design of 
detection systems. In the ocean, for example, one finds biological 
noise sources, noises due to wave action at the surface of the water, 
noises produced by meteorological conditions such as precipitation, and 
a great deal of shipping noise. For passive detection systems, these 
noise fields furnish the background out of which signals must be detected. 
From the experimental work of Parkins (1968) it seems reasonable to as- 
sume that the ocean noise field may, at least a large percentage of the 
time, be modeled as a weakly stationary, Gaussian random process. 

In order to understand the theoretical concepts of signal processing 
it is necessary to characterize the ambient noise field (and the desired 
signals) in commonly understood mathematical terms. The frequency domain 
(spectral) representation in which the "signal" is decomposed into a set 
of sinusoidal components as specific frequencies will be considered here. 


Select a particular sample function of noise and select from that 


sample an interval of duration L extending, say, from t = -L/2 to t = L/2. 


Such noise sample 5 9? (t) is shown in figure 2.1(a). Now generate a 
periodic waveform in which the waveform in the selected interval is re- 


peated every L seconds as in figure 2.1(b). This periodic signal 


Re) with period L can be represented as a sum of harmonically re- 


lated sinusoids (Fourier series) as follows: 





(s) 


pect) ې‎ ) A, cos(2mkf t - 4) (ze) 


k=1 
where A is a DC level; A and 9 are, respectively, the amplitude and 
phase of each sinusoid (line); and f (the fundamental frequency) 15 ۰ 
Plots of A, and by as functions of frequency are referred to as amplitude 


and phase spectra, respectively. AL and 9 are given by 


A = / a^ + b, Bro 2 324 (12202) 
and 
by = tan” (b, /a,) le 1 2 23er (2.3) 
where 12 2 
nz (s) E" 
a FT | n, (t) cos(2mkf t)dt |; Kk 1:2 NN (2.4) 
-L/2 
and L/2 
b, = = n SEULS sin(2nkf t)dt : k = 1,2,3 (2.3) 
k L t 0 , هوه ه ول وس و‎ . 
-L/2 


(a and by are Fourier coefficients). The DC level a, is obtained by 


(s) 


calculating the time average of بل‎ (t): 


1/2 
-i| a, 9) (oat (2.6) 
E 


a =A 
0 ٥ 


Several examples of amplitude and phase spectra of periodic signals are 
shown in figure 2.2. 

Although n: all periodic signals can be represented by Fourier har- 
monic — the class which can be is large. The limitations of the 
representation are discussed in Thomas and in Jenkins and Watts. Several 
other forms of Fourier series are also discussed in the literature. The 
form of eqn. 2.1 is a modified trigonometric version and is useful from 
an intuitive standpoint. Another form, which is more convenient for 


mathematical manipulations, is the exponential or complex version in 








Figure 2.1. (a) A sample neise waveform. (b) A periodic 
waveform is generated by repeating the interval in (2) 


from -L/2 te 1/2. 
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Figure 2.2. Amplitude and phase spectra of seme periodic 


Signals. 
11 





(s) 


which the signal n (t) is given by 


L 
(s) _ = Jm tst 
n f= Fee C205) 
L k 
k=-» 
where 1/2 
E (s) nett : ۸ 
C. L | nr (t)e dt لرل کل لاحم د‎ (2.8) 
-L/2 
The coefficients C. are complex and have the property that 
C =C 1 
وو‎ : (2:9) 


i.e. C and C are complex conjugates). These coefficients are related 


to the coefficients in equation 2.1 by 


Oca EN (2.10) 
Ak J Pk 
يون‎ e = 1/2(a + jb.) (291) 


Ihe exponential spectrum of the Fourier series produces a two-sided 
spectrum, each of half amplitude, one at frequency f and one at frequency 
-f. Although equation 2.7 indicates that the representation includes 
negative frequencies, this is not of physical significance because, in 
terms of this representation, the sum of the components at the positive 
and negative frequencies produces a real (in a mathematical sense) 
quantity. | 

Frequency domain representations of non-periodic signals require a 
generalization of the Fourier series approach. One might intuitively 
reason that L approaches infinity for a nonperiodic signal, which implies 
that the fundamental frequency (1/L) approaches zero. If this reasoning 
is carried out mathematically, it leads to a representation of the sig- 
nal as a continuum of frequencies rather than a "line" spectrum. The 


continuous representation (Fourier integral or Fourier transform) is 


12 





given by 


n(t) = | ode ar Qa) 
wnere = 
N(f) - | mu t dt (2:12) 


Ingeneral, N(f) is a complex quantity that can be represented by its 

amplitude and phase components as follows: 
N(E) = [N(E) [et ?? | (2.122) 

Plots of N(f) and $(f) as functions of frequency are called ampli- 
tude and phase spectra, respectively, and are analogous to the discrete 
A and OE Several examples of the amplitude and phase spectra of non- 
periodic signals are shown in figure 2.3. Note the similarity between 
the continuous spectra and the envelope of the line spectra in figure 
2. 

It should be noted, however, that not all time functions possess a 
Pourier transform. The condition for the existence of the Fourier trans- 
form is that the Fourier integral exist in the limit as Te, Existence 


is guaranteed by absolute convergence of the integral of equation 2.12, 


مه 


| Inc) |? at <=» (2.126) 


—00 
A number of functions having great usefulness do not meet this re- 

quirement; functions such as sine waves or step functions. These func- 

tions are nevertheless being handled by allowing Ihe more transform 

to contain impulses. In practice, the class of signals which are Fourier 

transformable is sufficiently large to encompass a wide variety of ap- 


plications. 
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Figure 2.3. Amplitude and phase spectra ef seme nonperiodic 


signals. 
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Fourier series and Fourier transforms of signals differ in their 
theoretical solutions attributable primarily to the fact that, for 
practical purposes, only finite length signals can actually be sampled 
and computed. The results of this computation are used only to provide 
estimates of the theoretical solutions. As seen in equation 2.7, the 
series method requires the sum of an infinite number of discrete sinu- 
soids and the integral method requires an infinitesimal frequency re- 
solution. Therefore, only approximations to these representations are 
obtained in practice. The discrete Fourier Transform (DFT), which will 
be discussed later, approximates a Fourier series by arbitrarily forcing 
the signal to be periodic and then truncating it.to a finite number of 


terms, 


B. DIGITAL SIGNAL PROCESSING 

During the decade of 1960-1970, an important new approach to wave- 
form manipulation, or signal processing, came into importance. It be- 
came practical to represent information-bearing waveforms digitally, that 
is, symbolically, and to do signal processing on the digital representa- 
tion of the waveform. Thus all the flexibility of digital computers 
became available for processing of waveforms, and this has had an effect 
on all fields in which waveform processing is important. It became ap- 
parent that one could manipulate waveforms digitally in ways that would 
be totally impractical with continuous representations. One advantage 
in this respect is the freedom from time constraints when a waveform is 
in a digital memory. Once a waveform is in a computer memory, it may 
be considered to have any desired time base associated with it. Asa 
result of the Fast Fourier Transform (FFT) algorithm, it became practical 


to do signal processing on waveforms in either the time or the frequency 
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domain, something never practical with continuous systems. The Fourier 
transform became not just a theoretical discription, but a tool; and 
real-time digital spectral analysis became a reality. Some of the 
practical considerations of performing this processing will be discussed 
in this section. 

1. Sampling Considerations 

Digital signal processing is based on the principle that an ana- 
log signal can be represented by the values of the signal at discrete 
points in time. This principle is embodied in the sampling theorem, 
which states that a band-limited signal is completely characterized by 
a set of samples taken at a minimum rate of 2f ax (called the Nyquist 
rate), where کو د‎ represents the highest frequency component in the 
signal. Although this statement of the sampling theorem appears straight- 
forward, there are several practical considerations which should be dis- 
cussed. 

One difficulty in applying the sampling theorem is that actual 
signals do not occupy well defined frequency bands, i.e., the spectra 
of real signals tend to gradually roll off until the amplitude of the 
spectral components reaches a minimum detectable level. Therefore, most 
sampling systems have a Ber just before the quantizing device to 
limit the signal to a predetermined band. This filter is called an anti- 
aliasing filter. 

When analog data are quantized i.e., converted from a continuous 
waveform to discrete values, at a rate of Í samples per second, the ana- 
log spectrum is divided into frequency bands of width 2 (figure 2.4a). 
During sampling, the energy in each band is folded or aliased into the 


lowest band (0 to £./2) and £ /? is called the folding frequency. The 
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folding operation can be envisioned if the bands in figure 2.4a are 
viewed from "above", as in figure 2.4b. The original analog spectrum is 
collapsed into the lowest frequency band, and the even-numbered bands are 
inverted in frequency as they are folded. Therefore, the net result of 
the sampling process is that, in spectra computed from the digital sam- 
ples, all energy in the original analog spectrum appears in the band 

0 to t /2 as seen in figure 2.4c. Figure 2.5a shows an example of a 
typical aliasing situation. It is obvious that the spectrum computed 
from the digital samples (figure 2.5b) provides very little information 
concerning the actual frequency content of the original signal because 
the original location of the computed spectral components is unknown. 
Only by containing the original spectrum in a single known band can 

the frequency content be unambiguously determined. 

Typically we think of two types of antialiasing filters, the 
low-pass filter and the band-pass filter. Both have application in 
digital signal processing and will be briefly discussed here. 

The low-pass anti-aliasing filter will be assumed to have a 
frequency response function as shown in figure 2.6. The amplitude level 


of the aliased components is given by 


K log, )(f£,-£,) - log, of, "T 
a. Bn 108, 02 . 


JPL POE GENE, cm gab (2.14) 


whe re f is the sampling frequency, f, is the filter cutoff frequency, and 


h 
M is the filter rolloff slope (db/octave). Equation 2.14 applies when 
the original analog signal spectrum is assumed to have components at 


frequencies that are above the folding frequency and equal in amplitude 


to components in the pass band. 
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Band 1 Band 2 Band 3 Band 4 





0 8 D E2 27; 


(a) Effect ef sampling the original analog spectrum 


2 B72 — 5f/72 772 





(b) Felding oporation viewed from "above", 


0 ft /2 


(c) Bandwidth ef ihe spectrum conputed fren the digital 


sanples, 


Figure 2.4. Aliasing effect of sampling en the Fourier 
spocirum. 
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Figure 2.5. ‘Typical antialiasing situation 
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" Figure 2.6. low-pass antialiasing filter 
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High-quality variable-cutoff commercially available filters 
generally have rolloff slopes from -12 to -48 db/octave. Sample fre- 
quencies usually are approximately three to four times the filter cut- 
off frequency and in order to provide a margin of safety from aliasing 
effects, an anti-aliasing filter slope of at least -36 db/octave and 
preferrably -48 db/octave are desirable. These figures apply to situa- 
tions in which a variety of general-purpose signal processing techniques 
will be applicable. Less conservative parameters can be substituted as 
the situation dictates. 

A bandpass signal of bandwidth W and center frequency be can be 
sampled at a rate greater than or equal to twice the bandwidth rather 
than twice the highest frequency component, if appropriate antialiasing 
filtering is performed. The bandpass antialiasing filter will be assumed 
to have the frequency response function shown in figure 2.8. Once the 
sample frequency has been chosen, the level of the components aliased 
from above the band of interest (assuming they are equal in amplitude 


to the passband components) is given by 
An = 3,32 M, log, o (X Den - 1) db (2715) 


where fu is the upper filter cutoff frequency, M is the filter slope 
(db/octave), - is the sample frequency, and K is the band number (see 


figure 2.7). The level of the components aliased from below the band 


of interest is given by 


Du con thes 7177 تت تت‎ - ٢ db | )2.16( 


where f. and M, are the lower filter cutoff and slope, respectively. 


2 
The procedure for choosing the sample frequency in bandpass sit- 


uations is not as straightforward as in the low-pass case. However, a 


fairly simple iterative procedure is as follows. 


a 21 


n 












signal spectrum 
Band 1 Band 2 Band/3 Band 4 


Figure 2.7. Sampling of a bandpass signal 
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Figure 2.8. Bandpass antialiasing filter 
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l. Define an effective signal bandwidth W' that is about two 
to four times the actual bandwidth of interest, depending on the steepness 
of the filter rolloff. 

2. Select a sampling band by choosing K to be the largest in- 
teger such that K « EW + 1/2. This will guarantee that the signal 
band W' will be encompassed by a sampling band. 


3. Define the sample frequency to be 


2f | 
= > ep )2.17( 
4. Calculate values for ALL and Aco: If these values are not 


acceptable, repeat steps 1 through 4 using a larger effective bandwidth W'. 
5. If it is inconvenient to generate the exact sample frequency 
Calculated in eq. 2.17, it may be adjusted as needed. Recalculation of 


A and A 
5 


sh will indicate if a significant amount of aliasing will result 


E 
from the adjustment. 

The effect of sampling properly filtered bandpass signals at a 
rate proportional to the bandwidth is the bandshift the signal spectrum 
to the baseband (figure 2.4c). This means that when spectra are computed 
with digital samples the frequency axis must be labeled in accordance 
with the location of the original signal spectrum. This location is a 
function of the sampling band number K and sample rate fo. For odd- 
numbered bands, the frequency axis should be labeled from (K-1)f /2 to 
kf /2 as shown in figure 2.9a. For even-numbered bands, the scale runs 
from Kf. to Kf /2 (figure 2.9b). Note that in the even-numbered bands, it 


is desirable to rotate the plot about the vertical axis and translate the 


frequency axis such that it runs from Kf /2 to Kf... 


29 
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Figure 2.9. Relabeling of frequency axis of computed 


spectra 
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2. Dynamic Range and Quantization Error 
The dynamic range of an analog-to-digital (A/D) converter can 
be defined as the ratio between the maximum and minimum peak signal 
values or as the signal-to-quantization noise ratio, depending on the 
usage of the digital samples. The maximum peak signal range in db 
(figure 2.10) is given by 


V 
max 


iem 
١ 


Ve 


6.02(n-1) (2. 19) 
where ER is the maximum input voltage level and n is the number of 
bits (including sign) of the bipolar A/D converter. For example, a 12- 
bit bipolar A/D converter has a peak signal range of about 66 db, which 
ES compatable with most analog instrumentation devices, e.g., tape re- 
corders. 

Quantization noise is caused by the stepwise approximation of 
the digital samples to the original continuous signal and can be treated 


as an additive noise component (figure 2.10). The RMS quantization 


noise is given by 


m max 
Ny = 0.29 jl 





(22295 


The RMS signal-to-quantization noise ratio (assuming a sine wave 


of peak amplitude UE in db is given by 


vao a 


SN. - 20 lo (2.21) 
Q 510 = 
‚29V 2 
max 
n-l 
= era 222 
20 1081۵ “Al ( ) 
= 20 log ر‎ (2 
10 

D+ 8 (2.24)‏ ح 
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Figure 2.10. Effects of quantizing an analog signal 
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Therefore, a 12-bit bipolar A/D converter will have a signal-to-quanti- 
zation noise ratio of about 74 db. 

Eqs. 2.19 and 2.24 both yield optimistic numbers, i.e., it is 
usually not possible to sufficiently control the amplitude of the input 
signal to the A/D converter so that the full voltage range is used. 

This means that the dynamic range can be reduced by 3 to 6 db, depending 
on the variability of the input signal. A buffer amplifier prior to 
the A/D converter is usually needed to condition the signal level. 

3. Discrete Fourier Transform (DFT) 

When analyzing signals whose parameters are known within broad 
limits, one of the first operations to be performed is some type of 
spectral analysis. This can be a single analysis, a series of analyses 
to indicate the time dependence of the spectral components, or an aver- 
aged spectral computation to approximate a power spectral density func- 
tion. This section discusses some of the considerations for using 
digital Fourier Transform techniques to analyze sampled signals. 

The DFT computes a finite Fourier series to represent a signal 
over a pre-defined time interval. The time interval L is chosen to 
include as much of the original signal as desired (figure 2.11a). The 
DFT, since it is a representation applicable to periodic signals, treats 
the signal as if it were periodic with period L (figure 2.11b). The 
discontinuities which may appear because of this forced periodicity are 
included in the representation and are of primary concern when performing 
DFT analyses. 

The DFT representation is closely analogous to the frequency 
domain representation for periodic signals (eqs. 2.1 through 2.6); how- 


ever, it is necessary to express the mathematical operations in terms of 
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(b) Assumed periodicity 
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Figure 2,11. DFT processing 
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sampled signal values. It will be assumed that the signal to be repre- 
sented or analyzed consists of a set of N equally spaced samples over 
the time interval L as follows: 

s(t,) = s(i-At) er 1, 2,...,N-1 (22255 


where At is the time between samples. The sample spacing is given by 


1 
= (2:526) 
S 


where - is the sample frequency. The representation of the original 
signal s(t) over the interval L is 
N-1 | 


s(t) = A. + d A cos(2mnAft + 0) O EET 2227) 


where 


Af = 1/L (2.28) 
the fundamental frequency or "bin" spacing. The amplitude and phase 


spectra are 


p M (2.29) 
n n n 
^ =] ^ ^ 
0 = -Tan | (b /a ) (2.30) 
n n n 
where ٨ : N-1 
a => )-s(t,) cosi(2amAft.) gon = 1,2,...,N-1 (2898) 
n N. 1 1 
i=1 Î ; 
» : N-1 
u > E eec o siu DEO EIE MO NET (2492) 
i-1 
The DC term A is 
^ ^ 1 AST 
a, = A, = N p s(ti) 
i-0 


The terms a, and b are commonly called the real and imaginary Fourier 


coefficients, respectively. 
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When the signal to be analyzed by DFT methods is real, the am- 
plitude and phase spectra have certain symmetrical properties. Mathe- 


matically, the symmetry is given by 


^ 


T E هلول‎ 2.34 
Aqj2)*-1/2) ^ ^quz)- 1/2) ? 17 b? N (2.34) 
which states that the amplitude spectrum has even symmetry about the 


point (N-1)/2. The phase spectrum exhibits odd symmetry about the 


point (N-1)/2. i.e. 


^ 


3 (8/2)- (1-1/2) - 9 (8/2) (1-1/2) POI wg eva 7 (2.35) 
In terms of frequency, the point (N-1)/2 can be expressed as £.12, the 
folding frequency. The portion of the computed spectrum between (N-1)/2 
and N-1 can be treated as the negative half of the spectrum which would 
normally occupy the band -(N-1)/2 to 0. For real signals, only the values 
between O and (N/2)-1 must be considered. 

Signal truncation introduces broadband leakage into the computed 
spectra, i.e. seemingly narrowband signals will appear to occupy more 
bandwidth than expected. Severe discontinuities in the pseudoperiodic 
signal may be present, depending on the relationship of the signal fre- 
quencies to the length of the period L (see figure 2.11). This can most 
easily be illustrated for the case of a sinusoidal signal. If the signal 
frequency E is such that 


CE — N دد‎ (2.36) 


(f is harmonically related to Af) the computed amplitude spectrum will 
consist of a single line component (figure 2.12a). If the signal fre- 
quency is offset by the amount Af/2, the spectrum shown in figure 2.12b 
is obtained. The apparent spectral width of this signal depends more on 


the truncation effects of the DFT calculation than on the actual signal 
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(a) Sine wave harmonically related to af 
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(b) Sine wave midway between two DFT frequency bins 


Figure 2.12. DFT amplitude spectra 
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bandwidth. The solution to this problem is to apply a "smoothing" 
window to the input data to eliminate or minimize the discontinuities. 
Windowing functions will be discussed in the next section. 

Although the subtleties of the DFT implementations are beyond 
the scope of this discussion, there are some implementation considera- 
tions with which the analyst must be acquainted. Until several years 
ago, spectrum analysis by digital techniques was an extremely time- 
consuming process, i.e., computation times proportional to n. where 
N is the number of points in the transform. 

Presently, most DFT calculations are performed with fast Fourier 
transform (FFT) algorithms which have computation times proportional to 
N log,N (N is assumed to be a power of 2). The FFT has made it possible 
to perform digital spectral analysis calculations in reasonable lengths 
of time, making on-line digital spectral analyzers a reality. However, 
at least in small computer systems, these algorithms are generally im- 
plemented in fixed point arithmetic which is subject to overflow. Thus, 
considerable care must be taken to properly scale the FFT computations, 
especially in situations where the signal energy falls into a small num- 
ber of frequency bins. 

. These overflows are particularly troublesome because they gen- 
erally appear as spurious components at unpredictable frequencies. The 
best way to prevent overflow while still maintaining the dynamic ampli- 
tude range of the calculation if to perform dynamic scaling on inter- 
mediate computed values in the algorithm. All scaling factors should 
be stored in a way that will permit the computed output values to be 
calibrated with respect to the known input voltages. Experience in- 


dicates that this type of scaling, along with appropriate windowing, 
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provides a dynamic range (ratio of the peak spectral component to the 
computational noise floor) of at least 80 db when implemented on a com- 
puter with 16-bit words, 
4. Window Functions 
In practice it is only possible to obtain finite lengths of 
records. The statistical questions to be considered later stem from the 
fact that it is necessary to estimate the accuracy of various functions 
obtained from finite amounts of data. From the preceding discussion on 
truncation effects and spectral leakage, it should be clear that the 
operation for taking a finite length record is equivalent to multiplying 
the actual signal s(t) by the data window w(t). The DFT actually uses 
samples from the function s'(t), which is given by 
sut) s Wu (E) TS WE) (2.37) 
where s(t) is the original signal and w(t) is zero everywhere but on the 


interval O to L. In the frequency domain, eq. 2.37 becomes 


s, (f) w(£)*s (£) (2.38) 


OO 


| S(g)W(E-g)dg |. (2.39) 


0 ے 


where * denotes convolution. This means the actual spectrum of s(t) 
is ١ 6 with the Fourier Transform of the window function w(t). 
Ihe DFT, in effect, samples this result at integral multiples of 1/L. 
Therefore, the shape of the window function's spectrum determines the 
degree of spectral spreading when a narrowband signal is truncated. 

Any reasonable data window w(t) will produce , spectral window 
W(f) which is concentrated about f=0 but with side lobes which damp out 
as f gets further away from zero, For small L, S (£) may give a very 


distorted picture of S(f) since the window W(f-g) will be wide and hence 
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values of S(g) far removed from g = f will contribute to S, (£) in the 
integral. As L becomes large the distortion will be reduced. Finally, 
as L tends to infinity, the transform component at frequency f can be 
fully determined, since data windows will tend to the generalized 
function ó as L tends to infinity. Hence, as L tends to infinity, W(f-g) 
tends to a delta function centered at g = f and so S, (£) tends to S(£). 
In reality, of course, this ideal situation is impossible since we are 
always dealing with finite-length records. 

Although there are many different window functions in the lit- 
erature, the advantages of windowing the data may be adequately observed 
by noting the differences between the rectangular window (i.e. simply 
truncating the data) and the Hanning (Cosine) window. It should be 
noted, however, that the application of any data window is simply a 
tradeoff between the bandwidth of the main lobe and the attenuation of 
the side lobes (i.e. a spectral window with a narrow main lobe will have 
as slower rate of attenuation of the side lobes than a window vith a 
wider main lobe). This comparison can be seen for the rectangular 
window and the Hanning window in the next two sections. 

a. Rectangular Window 

The rectangular window is applicable when samples of the 


original signal are used directly by the DFT. This function is given by 


l O CSTE SL 
W(t) = lo elsewhere (2130 
(see figure 2.13a). The Fourier transform of w(t) is 
Mm EL 77 oa 


TEL 
(see figure 2.13b and c). The actual location of the sin x/x function, 


and hence the amounts of energy leakage, in relation to the DFT frequency 
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(a) Time domain 
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(b) Frequency domain with input signal harmonically 


related to 1/L. 
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(c) Frequency domain with input signal half-way 


petween two bins. 


Figure 2.13. Rectangular window 
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bins, depends on the frequency of the input signal to which the window is 
applied. For an input signal frequency which is harmonically related to 
Af, the sin x/x function is sampled as shown in figure 2.13b and all sig- 
nal energy plus any other energy which might be present (noise) under the 
main and side lobes is reported as being present at the input signal fre- 
quency. For a signal frequency halfway between the two bins, the sin x/x 
function is sampled as shown in figure 2.13c and the signal energy is at- 
tenuated and spread throughout all the side-lobes giving a very distorted 
picture of the actual signal. The relative size of the side-lobe area 
compared with the main lobe area is very important when considering the 
amount of energy leakage. Figure 2.14 shows the spectrum of the rectan- 
gular-time window drawn to scale. 
b. Hanning Window 

The Hanning Window is adequate for general-purpose spectral 
analysis and is easily applied to data either in the time or frequency 
domain. : 


The Hanning Window is defined by 


1/20 * cos £75) — -L/2 « t < 1/2 


wit) = { (2.42) 
0 Elsewhere 


(see figure 2.15). The Fourier Transform of W(t) is 


W(£) Lim. R l sin(nfL-r) " 1 ern) (2.43) 


TIL 2 (nfL-r) 2 (nfL+r) 


_L sintfL 
2 د‎ 3191 0 


From equation 2.43 it is seen that the Hanning Window turns 
out to be the summation of three sin x/x functions. Since the main lobes 
are spaced 1/L Hz apart, this window has the effect of averaging three 


adjacent lobes, giving the center one twice as much weight as the side 
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Ses. Thus, there vill be a tendency to greatly decrease the size of 
the side lobes. The main lobe is reduced to one-half of its previous 
value and its width is doubled. The reduction in leakage, therefore, 
results in a corresponding widening of the bandwidth of analysis. The 
distance between the first zero crossing on either side of the main lobe 
is 2/L Hz. Figure 2.15 shows the spectrum of the Hanning window drawn 


to scale. 


Ge PROCESSING OF REAL DATA 

To see how some of the ideas presented thus far are applied to real 
data, consider the method of processing being used in the new "SPOTLIGHT" 
acoustic signal processor being developed by Prof. G. A. Rahe at the 
Naval Postgraduate School, Monterey. 

The idea behind the processor is to present the observer with a 
continually updated history of the spectrum of the incoming signal and 
to allow him to interpret the data appropriately. The display of the 
spectrum is presented as a three-dimensional random surface on a CRT 
screen as shown in figure 2.16. Each successive update of the display 
represents an estimate of the incoming signal's spectrum over a length 
of time T seconds وا‎ As each new estimate is presented, the oldest 
one is dropped off the back of the display and a new one appears on 
the front end. Each spectral estimate is formed as follows: 

1. A data window w of duration L seconds is applied successively 
to the available data in the overlapping intervals (0,L), (S,S+L),... 
[(P-1)S,T], where S is the amount of shift each adjacent data window 
undergoes, and P is the total number of pieces or segments employed 
(see figure 2.17). 

2. A DFT is performed on each of the P windowed sections using 


an FFT algorithm and a raw estimate is computed as the magnitude squared 
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tre 2.16. "SPOTLIGHT" processor display 






porc window w(t) 
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Figure 2,17. Overlapped data windows 
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3. The smoothed estimate is computed as the average of the P 
raw estiamtes. 


Since L seconds of data are used for each estimate we see that 


(P-1)S + L = T (2.44) 
Each estimate need not and most probably will not be independent of the 
previous estimate. The update rate of the estimates is a function of 
the operator's needs. The choice of the parameters L, T, and S is dis- 


cussed in the next section. 


D. PRACTICAL CONSIDERATIONS 

The effectiveness of the processor depends on its ability to make 
consistently stable spectral estimated using intelligently chosen para- 
maters. Some of the more practical aspects of the spectral estimate 
are now discussed. The parameters associated with making the estimates 
must necessarily be chosen prior to the processing of any data and should 
be made to meet certain specifications. 

The sampling rate must be chosen so that the spectrum can be esti- 


Bed in the frequency range of interest, Û < f < f Hence, the sam- 


ds 


pling rate must be at least 2f Care must be taken to avoid aliasing 


y 
as was discussed in section IIB. This is usually done with an anti- 

M asing filter prior to sampling the signal so that the power above 
fi the spectrum is effectively removed. 

Associated with reproducing the fine details of the spectrum is the 
fidelity of the estimate. To achieve high fidelity, the "Bandwidth" of 
the window used must be of the same order as the width of the narrowest 
important detail in the spectrum. It is, therefore, useful to be able 


to make an intelligent estimate as to the width of the smallest impor- 


tant detail required by the system to be "detected". For example, if 
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the width of the narrowest important detail in the spectrum is known to 
be .15 Hz, then as spectral window whose "bandwidth" is of the order of 
.15 Hz or smaller must be used to be able to resolve the fine details 
of the spectrum, Note that to resolve a spectrum to the same amount of 
detail, more data (a larger transform) would be required if using the 
Hanning vindow than if using the rectangular window since the "bandwidth" 
of the Hanning vindow is greater than that for the rectangular window 
(see figures 2.14 and 2.15). Assuming that the hardware is available, 
one should not be lulled into ignoring the above considerations and 
using the finest resolution available, because the requirements for high 
fidelity and high stability (to be discussed next) are conflicting and 
any practical realization is necessarily a compromise between the two. 
It seems appropriate at this time to say that the frequency "band- 
width" of a narrow band-pass filter (window) is described in several 
different ways. Three of the most common descriptions for bandwidth 
which are of interest to the power spectral density measurement are 
briefly described here. 


1. The half-power point bandwidth B is defined as 


B = f, ai (2.45) 


where 
2 2 
1١ ٢ 22717 - 


2 
luce | 


B is the frequency interval between the upper and lower frequencies 
where the filter attenuates an applied signal 3 db below maximum trans- 
missibility. This description of bandwidth is convenient because it is 
easy to measure, but is otherwise of little physical significance for 


random signal applications. 
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2. The noise bandwidth B is defined as 


| Img) | df 
m. — (2.46) 


n 


is the bandwidth of a hypothetical rectangular filter which would pass a 
signal with the same mean square value as the actual filter when the in- 
put is white noise. B is a convenient measure of bandwidth to use for 
normalizing narrowband mean square value measurements, as required for 
analog power spectral density measurements. 

3. The equivalent statistical bandwidth B defined as 


2 
Leia df 
A aec cae 


B = (2.47) 


S co 
1٢ df 


is the bandwidth of a hypothetical rectangular filter which would pass 

a signal with the same mean square value statistical error as the actual 
filter when the input is white noise. In other words, B is the descrip- 
tion of filter bandwidth which is technically appropriate when considering 
the statistical error € of a properly resolved power-spectral density 


measurement as given by 


(2.48) 


ar 


For the case of an ideal filter, the three descriptions of bandwidth 
are all equal. For any other filter characteristic, however, the band- 
width terms will generally be different. A comparison of these three 
definitions of bandwidth for the rectangular and Hanning data windows 


is seen below in table 2.1. 


43 





Rectangular 


Hanning 





Table 2.1. Comparison of Bandwidth Definitions 


For finite records, the extent to which the width of peaks can be 
estimated and fine detain detected is also influenced by the variance 
of the estimate. Hence, to be able to trust the fine structure of the 
spectrum, it must be possible to tie down the estimate to a given 
stability. The estimator is said to have high stability if the variance 
of the smoothed spectral estimate is small. A convenient description of 
a spectral estimate is given by its "equivalent number of degrees of 


freedom", K, defined (Blackman and Tukey, 1968) as 


= ss 


K : 
Variance 


(2249) 


Since a small variance is associated with a stable estimate, the larger 
the equivalent number of degrees of freedom the more stable the estimate. 
It will be shown in section III that maximum stability is obtained by 
1 in; the data windows as shown in figure 2.17. The amount of 
overlap used to achieve the maximum stability is dependent upon the par- 
ticular data window used. For a fixed transform length L, windows with 
lower sidelobes have wider and smoother main lobes and therefore require 
less overlap to achieve maximum stability. 

In addition to the fidelity and stability of the estimated spectrum, 
there are some other important considerations to be made when choosing 


the parameters to be used in making the estimates. For instance, the 
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statistics of the background must be a factor in determining the length 
of time, T, used in the smooth estimates. A particular advantage to 
this method of obtaining the spectral estimates is that L may be chosen 
so that slow variations in the background do not affect the performance 
of the processor. If necessary, the frequency resolution of the esti- 
mate may be adjusted by appending zeros to the data before taking the 
transforms. 

Additionally, if the desired signal is unstable - i.e., fading in 
and out due to the effects of the propagation medium, propagation paths, 
or just being transmitted intermittently - and therefore not present for 
at least lengths of time equal to T, then it is possible that the inte- 
gration process will actually attenuate the signal rather than enhance 
it.  Redundancy is the display will help alleviate the intermittent 
transmission problem by providing the operator with estimates which are 


not totally independent, but fade-in and fade-out problems due to propa- 


gation should be considered in choosing the integration time T. 
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III. THE SPECTRAL ESTIMATE 


In the development of modern communication theory, a vast body of 
knowledge involving various technical disciplines has been incorporated. 
The area of digital data analysis and measurement, for example, includes 
the widespread use of spectral analysis techniques made feasible by the 
availability of the fast Fourier transform algorithm. Increased emphasis 
on optimization and detection techniques also necessitates the use of 
modern statistical procedures. The very foundation of communication 
engineering includes such topics as probability theory, random processes, 
and information theory. This section will draw on what is necessary from 
the theoretical framework of modern communication theory to discuss the 
problem of estimating the power spectra of finite length records and 


determining their confidence level. 


A. STATIONARY STOCHASTIC PROCESSES AND ERGODICITY 

À process is termed deterministic if it does not contain any features 
of randomness; otherwise it is termed stochastic (or random). Spectral 
analysis is concerned mainly with a class of stochastic processes called 
stationary stochastic processes. A stationary stochastic process is one 
whose ical properties do not change with time. Signals generated 
by a stationary process have the property that a long enough segment of 
any signal recorded in some time period has essentially the same statis- 
tics as another segment of the signal observed at some other period. 
That is, a signal from a stationary process is not tied down to any par- 
ticular origin in time. 


There are several definitions of stationarity in current use. ۸ 
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process is said to be stationary in the strict sense if none of the 
statistics which characterize the stochastic process changes with time. 
Another definition of stationarity is wide-sense or weak stationarity. 
A process is said to be weakly stationary if the mean value and auto- 
correlation function do not vary with time. In other words, for the 
weakly stationary random process, the mean value is a constant and the 
auto-correlation function is dependent only upon the displacement T and 
not on the particular time chosen to compute the auto-correlation. 

If each signal of the ensemble of possible signals that can be gen- 
erated by a stationary process is typical of the ensemble as a whole, 
then the process is said to be ergodic. The statistics over a long time 
interval for any one signal are then the same as the statistics over the 
ensemble of all possible le at any one time instant. Thus an ergod- 
ic proce:s is one for which corresponding ensemble averages and time 
averages are equal. 

The class of ergodic processes is a subclass of the class of station- 
ary processes. That is, an ergodic process is necessarily stationary but 
a stationary process need not be ergodic. The application of theory to 
practice is very often simplified by making the assumption that a pro- 
cess is ergodic; however, except in special instances it would be diffi- 
cult or impossible to demonstrate that a physical process is ergodic 


other than by direct measurements and comparisons. 


B. DEFINITION 

The following discussion will examine the definition of the spectral 
estimate, the error involved in trying to make a spectral estimate from a 
finite length sample, and vays of improving this inherent error. To avoid 


confusion, the terms "spectral estimate" or "estimate" in this discussion 
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refer consistently to the estimate of each individual spectral component 
within the bandwidth of interest, B; and the term "spectrum" will be used 
to refer to all the individual estimates collectively. 

In order to proceed with estimating the spectrum of ocean ambient 
noise, some assumptions about the properties of the noise will have to be 
made. It will be assumed in this discussion that the ocean ambient noise 
may be considered approximately Gaussian and at least weakly stationary. 
Some recent research has shown that these assumptions are reasonable 
(Parkins, 1968). 

In the context of digital signal processing, the power spectral den- 
sity or power spectrum of a signal is defined as the magnitude squared of 
the Fourier transform of the signal. Inherent in the definition of the 
Fourier transform, however, is the fact that a signal of finite length 
will have a transform which contains all frequencies and, conversely, 

a band-limited spectrum emanates from a signal which extends from t = -œ 
tot= +», Therefore, in order to talk about a finite spectrum, the 


definition of the true power spectrum of a record n(t) is defined as 
-— 
G(f) - lim $e] [nce,7) |") 375 
T>» 


where N(f,T) is the finite Fourier transform of n(t), that is, 


| d 42 
N(f.T) =] n(t) e” dt G2 
0 


An estimate of the true spectrum given in Equation 3.1 for a sampie 
record of any length T may be obtained by simply omitting the limiting 


and expectation operators. This will yield an estimated spectrum of 


6(5,7) = = [nm] (3.3) 
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; l i : : 
where the frequency resolution will be m. Tnis estimate, as it stands, 
will be shown to be statistically unreliable, however. Methods of improv- 


ing the reliability will be presented in Section IIIE. 


See olLATISTICS OF THE SPECTRAL ESTIMATE 

As we discussed in Section II, N(f,T) is a complex quantity composed 
of the real and imaginary Fourier coefficients and Ixc£,m I? is the sum 
of the squares of the Fourier coefficients. Now, since n(t) is consider- 
ed to be a Gaussian random process with zero mean, and since the Fourier 
transform is a linear operation, it can easily be shown (see Section IV) 
that the Fourier coefficients are independent Gaussian random variables 
with zero mean and equal variances. 

Blackman and Tukey (Pg 21) show that each frequency estimate of G(f) 
is a chi-square variable with two "degrees of freedom". The number of 
degrees of freedom, K, may be thought of as representing the number of 
independent or "free" squares entering into the estimate. Note that Equa- 
tion 3.3 describes the estimated spectrum and that each spectral compon- 
ent is a chi-square variable iiit two degrees of freedom. The probability 


density function of us is given by 


oe Pay ¡2512 r(k/2)] T 1p 


(3:4) 
k 

where Ma) = ee e “dx (3) 

The mean and variance of the variable A are 

21 = 
k 
Varıy ?ı= 02 = Ne 27) 
[Xk J x2 
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Note that the variance of the spectral estimate is independent of the 
record length T. That is, increasing the record length does not reduce 
the variance of the estimate; it only increases the number of spectral 
components in the spectrum. Since the record length is a measure of the 
sample size for the estimate, this implies that the PSD estimates given 
by Equation 3.3 are inconsistent. To be consistent, an estimate should 
approach the parameter being estimated with a probability approaching 
unity as the sample size becomes large. That is, for any € >0, 

lim Prob ٠ - G| > e} =0 (3.8) 
To 

Because the estimates given by Equation 3.3 are inconsistent, the 
spectrum computed from that equation is called the “raw spectrum" and the 
components of the spectrum are called the "raw" components. It will be 


shown that more nearly consistent estimates can be made by "smoothing" 


the raw estimates. 


D. ERROR ASSOCIATED WITH THE SPECTRAL ESTIMATE 
The accuracy of parameter estimates based upon sample values is usu- 


ally described in terms of the mean square error defined by 
i 7 
Mean square error = E [(G - 6) ] (3.9) 


It can easily be shown that the mean square error is made up of two 
parts; the variance of the estimate (random error) and the square of the 


bias error of the estimate, that is, 
` A 2 ^ 
Mean square error = Var [G] + b [G] (3.10) 


An estimate is said to be unbiased if E[G] = G. Therefore, the bias 
error of the estimate is defined as 


Bias error = b [C] = E[G] - ۴ (3.11) 
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In Bendot and Piersol (Pg 187), the bias error for the spectral esti- 
mate is shown to be proportional to the second derivative of the true 
spectrum. Therefore, for a relatively smooth spectrum over the effective 
bandwidth B of each estimate, the bias error is usually negligible com- 
pared with the random portion of the error, VAR[GJ. The mean square 
error is thus approximately equal to the square of the standard error 


where the standard error is defined as 
Standard error = . VAR[G] = c[6] (3:12) 


If this error is expressed as a fraction of the quantity being esti- 


mated, it is called the normalized standard error, €» and is given by 


o [6] 


Ene ES (SES) 
Remembering that 
E [X,2]= E[G] = k = G (3.14) 
and 
VAR [x,2} = VAR [G] = 2k (3.15) 
we have 
o ve (3.16) 


And since K = 2 for the raw spectral estimate, we have a standard error 
of unity, which says that the standard deviation of the estimate is as 
large as the quantity being estimated — which is obviously unacceptable. 
The smoothing operations presented in the next section will increase the 
number of degrees of freedom associated with each estimate and thus de- 


crease the standard error considerably. 


E. SMOOTHING THE SPECTRAL ESTIMATE 


The method of reducing this error is called "smoothing" and may be 
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accomplished in one of two ways. The first way is to smooth over an 
ensemble of estimates. This can be done by computing individual esti- 
mates from q sample records and then averaging the q estimates at each 
frequency. The second way is to smooth over frequency. This can be done 
by averaging together the results of £ contiguous spectral components in 
the estimate from a single sample record. In either case, the smoothing 
operation approximates the a pon in equation 3.1 and 
hence makes the estimates consistent. 

For ensemble smoothing, the estimates from q independent (i.e., non- 
overlapping) time slices are averaged together. If each time slice is of 


length T', then the length of the original sample is T - qT' and the final 


smoothed estimate is given by 
1 
6 po t+G, , +... +6. _] (3.17) 


„s 


, ; toe 
where Gi is the raw estimate at frequency f of the q time slice. 
3 


Noting that As 1 = we see that the smoothed estimate will be a 


2 
*a+b 
chi-squared variable with roughly 2q degrees of freedom. The smoothing 
operation did not affect the bandwidth of the estimates; hence each esti- 


mate will have an effective bandwidth of B = ‚ where a depends on the 


= 
T 


particular window being used. Hence, by ensemble averaging 


er ra 
B d (3.18) 


E, noe JE | (3.19) 


The filter shape of each smooth estimate will be the same as for the 


and 


raw estimate which is determined by the Fourier transform of the original 


data window used. The estimate C is considered as representing the 
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midpoint of the frequency interval covered by the filter (B. is the effec- 
tive bandwidth of the filter). Figure 3.1 illustrates the process of ob- 
taining smooth spectral estimates using ensemble averaging. 

For frequency smoothing, the estimates of 2 contiguous spectral com- 
ponents from the raw spectrum are averaged to form the smooth estimate 
(the raw spectrum referred to was computed from a time record of length 
T). Hence, the ns spectral component G, of the smoothed spectrum will 
be given by 


T. 
کد«‎ us niic oie N 
2 2 


: „th a 
where G. is the i component of the raw estimate and it is assumed that 


& is odd. G, will be a chi-squared variable with roughly 2 degrees of 


freedom by the same reasoning used with ensemble smoothing. The effec- 
tive bandwidth, Bos of the smoothed estimate will depend on the value of 
2 and will be given by 


Nee 
B M GET NN (3.21) 


where the effective bandwidth of each raw estimate used in the average 
was B. - a/T. Again the value of a depends on the data window used. The 
filter shape of the smoothed estimate will be approximately trapezoidal 
as shown in figure 3.2. The process of obtaining a smooth estimate 
through frequency averaging is shown in figure 3.3. The estimate G, is 
considered as representing the midpoint of the frequency interval covered 
by Bo A total of N/£ independent estimates can be obtained in the 


smoothed spectrum. The standard error of the estimates obtained by fre- 


quency averaging is given by 
ya k Q 
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(c) Rav spectrum from segment two 
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(e) Smoothed spectrum by ensemble smoothing with q=3 


Figure 3.1. Example of ensemble smoothing 
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For combined smoothing using both frequency and ensemble averaging, 


the final effective bandwidth of each estimate will be approximately 


£ 
١ E 
p e % 


and the resulting G. will be a chi-squared variable with roughly 22%q 


degrees of freedom. The normalized standard error € is given by 


MP co 
2 f A (3.24) 


Even though the stability of the estimates may be improved by EE 
ensemble or frequency smoothing or a combination of the two, it is advis- 
able to use only ensemble smoothing in a system designed to detect narrow- 
band signals. The frequency resolution of the estimated spectrum will 
generally be determined to be of the same order of magnitude as the sig- 
nal to be detected, and taking longer transforms and frequency smoothing 


would be a very inefficient use of the available resources. 


F. INCREASED STABILITY THROUGH OVERLAPPING SEGMENTS 

The term "degrees of freedom" introduced in Section IIIC as a measure 
of the stability of the estimate was said to refer to the number of inde- 
pendent or "free" squares that were summed to form the estimate. A 
M ehtiy relaxed definition of the number of degrees of freedom of an 
estimate, called the "equivalent number of degrees of freedom", can be 
used by noting that for a chi-squared variable 


2E* [6] 


— Var[G] 


(3.25) 


This definition of K will be used to see if the stability of the estimates 
may be increased still further by overlapping the segments used in the 


ensemble smoothing operation. 
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(a) Filter shape before frequency smoothing 


final spectral window 
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(b) Filter shape after frequency smoothing 


Figure 3.2. Filter shape before and after freq.smoothing 
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n(t) 


(a) Sample record used for the spectral analysis : 


Gf) 
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3E 
(b) Raw spectrum from transform of n(t) between 


fr Oana’ tf = T 


Gf) 
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(c) Smoothed estimate by frequency smoothing with f=3 


Figure 3.3. Example of frequency smoothing 
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Referring to the notation of section 2.2.5 and figure 2.1/, we see 


that each smooth estimate is computed by 


Ñ TEEN. > 
Ber | (3.26) 
i=1 


where G, (£) is the finite Fourier Transform of the ith windowed section 


and P is the number of segments used. That is 


27+ (1- 5 
G, (£) = | SONS i ED oro sae (3.27) 
s+ (i-1)s 1<ic<p | 


The spectral estimate G(f) in equation 3.26 is a random variable. 


Its mean and variance were computed by Nuttall (1971) and are given by 


2 j 2 2 
E[G(f)] = | G(f - v) |w(v)| dv = G(f) | [WG) | dv (3228) 
E) 2 
Var[G(f)] = (E) E D (1 - Hb, as) (32225 
i--(P-1) 
where NT) = | W(t) W(t - T)dt (3230) 


e. 00 


under the assumption that n(t) is a Gaussian random process and that the 
Frequency resolution of the spectral window ۱ )۶( | is narrower than the 
finest detail in the true spectrum G(f). 

Equation 3.28 anos Chat the mean of the spectral estimate is equal 
to the convolution of the true spectrum G(f) with the spectral window 
wc |”. Equation 3.29 expresses the variance of the spectral estimate 
in terms of the number of pieces P, the shift S, and the autocorrelation 
$a of the data window. Using the results given in equations 3.28 and 


3.29, we can compute the equivalent number of degrees of freedom of our 
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estimate in terms of the number of pieces used and the autocorrelation 


of the data window as follows; 


2 
_ 2E_[&(£)] 


۹ 
221 Iw) l^ av] 
E (3. 32( 


P-1 
) (- Ehe (is) |" 


i--(P-1) 


But $,, C) - | W^ (t)dt (SAS) 


=- 00 


A. A A (3734) 
P-1 $,, (18) 
) (i 41 


i--(P-1) 








Notice that K is independent of the value of frequency f and true spec- 
trum G(f). It depends only on the particular data window W and the num- 
ber of pieces P. Noting that the shift S is uniquely determined as a 


function of P,L, and T, as shown in equation 2.44, we have 


E 2P 
a te Bb 7 
1+2-) ( “wi Pe | 
i=l m 


Í 
= 5 





It is interesting to note that for a given T/L, a maximum value of 
K occurs at some value of overlap less than 100 percent. The value of 
overlap is a function of the particular data window used and is relatively 
independent of the value of T/L. As described in section IIC, for 
P = T/L, K-» 2P. As P increases for a given T/L, the value of K will 
increase but at a continually decreasing rate because the overlapped 


pieces are progressively more statistically dependent. K will eventually 
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reach a maximum value and then decrease slightly as the amount of overlap 
approaches 100 percent. A typical curve of K versus percent overlap is 
shown in figure 3.4. Note that the amount of overlap required to achieve 
90 or 95 percent of maximum stability is considerably less than that re- 
Ensred for 100 percent. The extra computational effort required for 
highly overlapped processing should be given serious consideration before 
implementing the maximum value of K. 


The rectangular data window is described by 


W(t) » u(t) - u(t-L) 
where u(t) is a step function. 


The autocorrelation function as defined in equation 3.30 is computed 











to be 
eD FLS |r] ED (3.36) 
and m 
p(T) _ T 
$0y ^ 1771 ND (3.37) 
For the Hanning window 
1 E 
W(t) = zQ = COS =. [u(t) - u(t - L)] (3536) 
thus | o(t) = (1-1) (1+ eos tI sin“ (3.39) 
APD 
OIN لن در‎ EL EAE 1 7 
and 6(0) ^ 3r i 1) (14 ¿Cos ) + 5, sin (3.40) 
qs] zn 


Tables 3.1 - 3.4 show how K varies as a function of P for several 
values of T/L, as given by equation 3.35, for the rectangular and Hanning 


windows. 
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Figure 3.4. Typical K vs S curve 


61 





TABLE 


PF NOU ON DEDO O 10 O ال هق‎ 


3 


62 


RECTANGULAR 


LS K ys PEOR WL = 4 


HANNING 





S (%) 


TABLE 3 


oO 
Up 
Qv M29 (GVO - CO ON I9 -J O IS r9 O0 I9 GO O N YY FF O IM uA 


RECTANGULAR 


.2 K vs P FOR T/L = 8 


63 


HANNING 





S (%) 


TABLE 3.3 
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K vs P FOR T/L = 16 
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HANNING 


RECTANGULAR HANNING 


P S (%4) K K 
32 0 64.0 64.0 
33 3.1 65.88 66.0 
34 6.1 67.52 68.0 
35 8.8 68.96 70.0 
36 TIT 70.22 72.0 
B 13.9 032 74.0 
38 16.2 72.30 76.0 
39 18.4 73.16 78.0 
40 20.5 73.93 80.0 
50 36.7 79.08 99.61 
60 75 83.16 115.85 
70 Sl 86.53 125.43 
80 60.8 87.94 129.31 
90 65.2 89.11 130.36 

100 68.7 90.29 130.52 
101 69.0 90.37 130.53 
102 69.3 90.43 130.53 
103 69.6 90.49 130.52 
104 69.9 90.55 150952 
105 Ome 90.61 130. 52 
110 71.6 90.84 130.50 
120 74.2 91.29 130.45 
130 76.0 91.82 130.40 
140 E 92.08 130.35 
150 . 42 92.29 130.30 
200 84.4 93.07 130.11 
.300 89.6 93.55 129.92 
400 92.2 93.70 129.82 
. 500 ! 93.8 93.75 129.76 


TABLE 3.4 K vs P FOR T/L = 32 
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ee CONFIDENCE INTERVAL FOR THE SPECTRAL ESTIMATE 


Since the sampling distribution of the smoothed estimate is approxi- 
mately chi-square with K degrees of freedom, we have a (1 - a) confidence 
interval for a power spectral density function G(f) (based upon an esti- 


mate G(f) measured with a resolution bandwidth B and a record length T) 


given by 
JA < G(f) < ~ KEM) (3.41) 
XK: a/2 Xk: 1 - a/2 


The confidence statement is interpreted as follows: The true PSD G(f) 
fete fall within the noted interval with a confidence coefficient of 
(1 - a), or in more common terminology, with a confidence of 100(1 - o) 


percent. 
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IV. PROCESSING GAIN OF A FOURIER PROCESSOR WITH INTEGRATION 


A. INTRODUCTION 

Section III discussed the spectral estimate by the use of the fast 
Fourier transform technique and the method of improving the estimates for 
a given record of finite length. It was shown that the measure of 
quality or reliability of the estimate is the mean square error (or vari- 
ance) and that this error can be made arbitrarily small if a longer 
record were available. This situation often creates a three-way com- 
promise among effort, resolution and stability (precision) of the 
estimate. 

This section will interpret the stability of the spectral estimates 
in terms of processing gain within the context of a Fourier signal pro- 
cessor. Figure 4.1 shows a model of the system. Its mission is to de- 
tect the presence in a random noise background of discrete narrowband 
signals with random phase variation. The decision element of the syetem 
is assumed to be a human pee monitoring a display similar to the 
Beewdiscussed in section JJ c. The sensitivity of the processor is us- 
ually defined in terms of. a minimum detectable signal for a given pro- 
bability of detection and false alarm. Because of the presence of the 
human operator in the system, the level of minimum detectable signal is 
difficult to determine and will not be discussed. A brief look at the 
theory of signal detection by a human operator will be coverd in section 
V. It should be noted that the "processing gain" discussed in this sec- 
tion is derived under rather idealized conditions. The effects of 
redundency and Hanning weighting are considered but losses due to hard- 


ware limitations are not discussed. Losses due to actual implementation 
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DETECTOR 
(OPERATOR) 








Figure 4.1. Fourier processor with integration 
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have been found to be small if careful consideration is given to the 


areas discussed in section II. 


B. PROCESSING GAIN DEFINED 

When talking about processing gain of a Fourier signal processor, it 
is convenient to view the DFT as a bank of N independent narrow band 
filters, where N is the number of equally spaced samples (each of width A) 
within the time of L the data to be transformed. The center of each 
"filter" will be located at multiples of the fundamental frequency 
f = = Hz and the shape of each filter will depend upon the data window 
used as previously discussed. Regardless of which window is used, however, 
maximum power transfer will occur only if the desired signal if a multiple 
of the fundamental frequency. If the Signal is not an even multiple of 
Eos then it will be attenuated and its energy spread throughout the 
spectrum (the extent of the leakage will again depend on which window 
is used) as shown in figure 2.12. 

The processing gain (PG) of a digital processor which utilizes the 
DFT as the initial filtering um and then incoherently integrates 
the output of each filter (i.e., ensemble smoothing) will be made up of 
two distinct parts, one part due to the DFT itself (PG,.) and one part due 
to the incoherent integration (PG,). Heuristically, this makes sense, 
since the signal-to-noise power ratio (SNR) will obviously be improved 
by passing the signal through a narrowband filter. And then the results 
can be improved even more by averaging the random noise power over time 
since the average signal power is approximately ME Therefore, 


the processing gain for the system is defined as 


= . 4.1 
PG PG, PG, (4.1) 
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For simplicity, the processing gain will be derived in terms of a 
single discrete frequency component in an additive white gaussian noise 
background. The assumption of a white noise background is really not 
very restrictive since all that is required is that the power spectrum 
of the noise be flat over the bandwidth of each filter. Further, it 
will be assumed that the signal to be "detected" will be an integer 
multiple of the fundamental frequency, Los 1/L, so that maximum power 
transfer is achieved by the filter. This is a reasonable assumption 
since we are interested in the maximum gain available from the system. 
Also, it will be assumed that the signal is stable in frequency through- 
out the interval of observation and that it is actually present in the 
background during that time. Exceptions to the above assumptions will 
be discussed later. 

The processing gain of each section of the processor will be de- 
fined as the ratio of the signal-to-noise power at the output of the 
section to the signal-to-noise power at the input of the section. For 
Standardization, the noise power used is usually the average power of 
the noise in a 1 Hz band. Thus for the u filter of the DFT (see 
figure 4.2), 

SNR 


u = SNR. (4.2) 


SNR, 
a; = SNR, (4.3) 


i : : Eee 
Notice that the signal-to noise ratio at the output of the j filter 


is the input signal-to-noise ratio to the integrator (i.e., SNR, = SNR,). 


Deore, SNR, SNR, SNR, 
۰ = ېه سا يه امس م ل‎ A 
EE n SNR, ' SNR, SNR, ur Gr 


the processing gain for the entire systen. 
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INPUT l | OUTPUT 


Figure 4.2. Reference for signal-to-noise ratios 
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For purposes of the following analysis, only the processing gain of 
a typical filter will be derived. Results for the remaining N-1 filters 
of the processor may be generalized from this one by considering whether 


or not a signal is present within the bandwidth of the filter. 


C. PROCESSING GAIN OF THE DFT (PG, 3? 


The En filter output of the DFT is defined as 


N- as 
3 DE. eem (4.5) 
J k-0 
where s k = 0,1,2,...,N-1 are the N sample values of the input signal 


Ie? 


to be transformed. 
; Eliae. 
With noise only present at the input, the output of the j filter 


may be written as 


N-1 ae 
C a n E LKN (4.6) 
j "ug 5 
where n,, k = 0,1,2,...,N-1 are sample values from a zero mean Gaussian 


17 
random process. In terms of the real and imaginary coefficients, the 


complex quantity ec can be written as 


c. ca Fib. 41 
J J J ( 
where ~>- s N-1 
a, = Re(c,) = = ) n COS (2T jK/N) (4.8a) 
j j Nw k 
k=0 
-1 N-1 
b, = In(e,) SES D n SED (4. 8b) 


Now since the DFT is a linear operator, the random variables a and o 
are each zero-mean Gaussian random variables when the input signal is 
noise alone. To show that a, and 8 are also independent, we need only 


to show that E[a,h.] = 0. 


(2 





Now 


10) 


11) 


12) 


13) 


1 Y 2 Ww 
115 5 SED ny cos جوا‎ i, 4510 (49) 
D N zo E O 
1 N-1 N-1 
HE ) Y E[n, n,] cos(2rjk/N) sin(27j£/N) (4. 
N k=0 2=0 
1 N-1 2 
20S ) E[n, ] cos 2rjk/N) sin(2rjk/N) (4. 
N k=0 
c? N-1 
u RA. (4. 
N^ k-0 | 
E[a;b. ] = 0 (4. 


2.2; 
where o is the variance (average power) of the input noise signal. 


Therefore, 2 and g are independent zero-mean Gaussian random variables. 


They also have equal variances as shown below: 


2 1 N-1 1 N-1 
Var[a,] = E[a, ^] - E[G; ) cos 2-mjk/N) C- ) n, eos 27j4/N)] (A. 
j j N =0 k N 1-0 x 
1 N-1 N-1 
= EZ, ) ) nn, cos (27jk/N) cos (21j%/N)] (4: 
N k=-0 £20 
N-1 
= i ) Eln ^] C OE) Cae 
N k=0 
2 N-1 
Es ) 1 + cos(4rjk/N) (4. 
AN k=0 
c * 
= ON (4 
And 
) 1 N-1 1 N-1 
Var[b,] » E[b, ] » E[G; ) n, sin 2mjk/N) G ) n, sin 27j2/N)] (a. 
j j N k-0 k N 2-0 Y 


718 


14) 


15) 


16) 


17) 


.18) 


19) 





N-1 


2 2 

= 2, ? Ein, ] sin (21jk/N) (4.20) 
N k=0 
q? N-1 

Ze, ) 1- cos (2mjk/N) (4.21) 
2N k=0 

= o^ (12225 
2N 


th ; 
Now the power spectral component of the j filter is defined as 


G, = aha T 1 ! (4023) 
J J J | 


and the average power in that bin is therefore given by 


DE EU NN ^ vemm (4.24) 
J J J 
y? 
S (4:255 
Now if a signal is present, the sample values S, SO LL Nl 


are each the sum of a zero mean Gaussian random variable and a constant 
REN سر‎ : 
A cos (271fkA + 9). So the j filter output can be written as 


N-1 


C, - Ii J [n *'^cos QmfkA + Q)]e N (4.26) 
i Bee 
k=0 
N-1 peT N-1 m 
Ci. د‎ ٢ ) n e pe ) A cos (21fkA + $)e MOST (42279 
j N =0 k N «=0 


which can be seen to be the sum of two independent parts, one due only to 
the noise and one due only to the signal. The noise portion was handled 


above. Consider now only the portion of the output due to the signal. 


Then, 
N-1 7 
C aye J cos (2mfkA * en lei (4.28) 
j,S Nee 
k=0 
Ne OnE Ee (ata ida RN (O 
== ) le +e je 
SE 
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Now under the assumption that the signal is in the center of the bin we 


have fA = j/N. So 


N-1 . . et 
کر = : ن0‎ ) T + AE UE (4.30) 
8 k=0 


ER 
The total output of the j bin with signal plus noise at the input is 


thus given by 


N-1 
m^ 17 
C. = an += ) y e trans (4.32) 
J k=0 
N-1 
=- (Ê i N-1 
> Sn i eier ve nee) 
k=0 2 N k 
k=0 
)4.33( 
The power spectral component at frequency j/N is given by 
^ 2 
G. = |c.| (4.34) 


J J 


2 N-1 N-1 N-1 


c, = 7 + s cos t D m, Qnin += sin $ Lom. sin(21jk/N) + 2 lw 
(4.35) 
and the average power at that frequency is given by 
Avg Pwr - ELG] (4.36) 
: a + s (4.37) 


: | cn et 
since EIN, J = 0. Notice that the average power in the j bin is made up 
of two components, one due only to the noise and one due only to the sig- 
nal. This is not an unexpected result since the noise and the signal were 


uncorrelated and therefore, the power should be additive. 
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Referring to equation 4.2, we are now ready to compute PG, D The 


> 


Signal-to-noise ratio of the input signal is given by 


No 


A 
SNR, = 





(4.38) 


و 


20 
NR : : 2 ; 
where o is the average power (variance) of the noise and A /2 is the 


RMS signal power. From 4.37 it is obvious that 


2 
SNR, = EN -N our (4.39) 
2 4 2 2 1 . 
o f 
SNR, N 
therefore Fon; = SNR, = 7 (4.40) 


which is a realistic result since the original noise power has been ef- 
fectively separated into N/2 bins (see symmetry conditions of the DFT 
in section IIB) and the average sigan] power is totally contained within 


h 
the j ` bin. 


D. PROCESSING GAIN OF THE INTEGRATOR AND TOTAL SYSTEM GAIN 
The processing gain due to incoherent integration in the gel bin is 
due entirely to the variance reduction (or smoothing) of the noise. This 


may be seen as follows: 


. 2 
SNR, = SNR. = ه٨‎ (4.41) 
3 2 k 2 IN 
i 3 
A JA 
and SNR, = CENE ) 
o, /N 


2 2 : Oop : 
where 0, and g, are the variances in the j bin due to the noise at 


stages three and four, respectively, of figure 4.2. Referring to Equation 


4.3 we see that 


= em (4.43) 
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= =y yE nu 





The variance of the smoothed spectral estimate was computed by 


Nuttall (1971) and is given by 





A 2 (P-1) 
G 
Var[G] - ux (1 - Ele, (as) |" (4.44) 
== 10) 
P-1 
G^ f i ; 
- zs (s (0) |? +2) (1 - =) | Gs) | (4.45) 
i=] 
where G(f) = true spectrum of noise 
$ (T) = autocorrelation function of the data window 
P = number of raw transforms averaged to obtain 


the estimate 


S = amount of overlap used in the P transforms in 
seconds. 


The reader is referred to figure 2.17 and section III F for a review of 
how the above parameters relate. 
Now, for the raw spectral estimate (i.e., estimate available at stage 


3 of figure 4.2) P = 1 and we have no overlap. Therefore 


! 2 
0,7 = 6 (£)|4, (0) | (4.46) 


For the smooth spectral estimate (i.e., estimate at the output of 


the integrator), P»»»1 and we have 





2 P-1 
2. G (f) i ME 
o C (el t2 20 2 5) 1¢ (48) | (4.47) 
Therefore PG, , = A en. (4.48) 
? l+2 (1- الا‎ 
i=] E P 
K 

eim (4.49) 
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where K is the equivalent number of degrees of freedom of the estimate 
(see section 3). Note that if P = T/L, there is no overlap and PG, 47 P. 
9 


For P > T/L, the processing gain will be greater than P. Combining 


equation 4.49 and equation 4.40, the system processing gain is given by 


PG = (4.50) 


په 


To maximize the system processing gain for a given T/L, we want to 
overlap the transforms so as to maximize the value of K. Figure 4.3 
shows that رک‎ is linear as a function of T/L if the Hanning window: 
is used on the data. Similar results could be obtained for any data 
window using equation 3.35. Figure 4.4 shows the amount of overlap 
required to obtain K  , .99K anid, , 95K as a function of T/L. 

max max max 

Note that while the amount of overlap required to obtain 5 ases increases 

slightly with T/L, the overlap required to obtain .99K or 285k 
max max 

is nearly independent of T/L. The slight discontinuities in figure 4.4 

at small values of T/L are due to the large percentage overlap changes 

with the addition of one piece (or transform). Thus, for the Hanning 

window it would seem that 60 % overlap would be the optimum overlap 

from a processing gain point of view. Figure 4.5 shows how the value of 

K varies as a function of T/L if other than optimum overlap is used. 


From figure 4.3 we see that 


K 
max 


4.15 =) - 2.20 (4.51) 


So i 
96 2 1) 0 T/L = .55) (452) 


or in decibels, 


PG = 10 log, oN #10 168 5ل‎ ML = 255) db (453) 
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Figure 4.4. 
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Figure 4.5. EDF vs percent overlap for Hanning window 
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Usually one would expect N>>T/L, so that the first term of equation 
will be the most dominant factor in figuring the processing gain. For 


example, if N = 8192 and T/L = 16, 


EG = See 295 = 51.18 db (4.54) 


E. NON-IDEAL CONDITIONS 

In the above discussion of processing gain, it was assumed that the 
signal frequency remained constant over the entire period of observation. 
If this is not the case, then both PG, and PG, will be less than ideal. 
For example, in the case of the transform, assume that the input fre- 
quency is f = j/L for the first N/2 samples and then abruptly jumps to 
f = (jt+l)/L for the remaining N/2 samples. Then, the processing gain 
(PG,,) for both the ٣۳ and Ge filters is N/8, or a loss of 6 db 
over the ideal case, 

Next assume that the frequency is a constant f = j/L for t seconds 
of the total T seconds of integration, and then abruptly jumps to f = 
(jt1)/L for the remaining T-t seconds. Then the average power in the 
pen bin will be attenuated by a factor of t/T and thus PG, for the qe 
bin will be attenuated by a factor of t/T (i.e., a loss of 10 logjg(t/T) 
db) from the ideal case. The loss from the ideal case for the ٢ 
bin will be 10 108,0 (T-t)/T db. Thus we see that if the integration 
time is significantly larger than the time the signal spends in the bin, 
then we can quickly eliminate any gain we might have obtained through 
the DFT. For example, if T = 256 and t = 80, the loss in the aa bin 
amounts to 15 db over the ideal case. Careful condideration should 
therefore be given to the integration time and frequency resolution to 
to use for the particular problem at hand. Very fine resolution and a 


long integration time will not aid in detection of a signal which is 


unstable in frequency and e in and out besides. 





V. SIGNAL DETECTION AND THE HUMAN OPERATOR 


A. INTRODUCTION 

In the mid-1950's the theory of statistical decision, motivated by 
problems in radar, was translated into a theory of signal detection. 
Although the theory specified an ideal process, the generality of the 
theory suggested that it might also be relevant to the detection of sig- 
nals by human observers. The relevant theory seemed to provide a frame- 
work for a realistic description of the behavior of the human observer 
in a variety of perceptual tasks. A brief overview of the general the- 
ory and a description of certain elements of the signal detection theory 
appropriate to the human observer are included here to point out the 
flexibility that a human operator can provide in a signal detection 
System. 

Statistical decision theory specifies the optimal behavior in a 
situation where one must choose between two alternative statistical 
hypotheses on the basis of an observed event or series of events. In 
particular, it specifies the optimal cutoff, along the continuum on 
which the observed Cris arranged, as a function of (a) the a 
priori probabilities of the two hypotheses, (b) the values and costs 
atea with the various decision outcomes, and (c) the amount of 
overlap of the distributions that constitute the hypotheses. In short, 
the detection problem is a problem in ME decision; it requires 


testing statistical hypotheses. 


B. FUNDAMENTAL DETECTION PROBLEM 
In the fundamental detection problem, an observation is made of 


events occurring in a fixed interval of time, and a decision is made, 
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based on this observation, whether the interval contained only the back- 
ground interference or a signal as well. The interference, which is 
random, is referred to as noise and is denoted by N; the other alterna- 
tive is termed signal plus noise and is denoted by SN. In the funda- 
mental problem, only these two alternatives exist — noise is always pre- 
sent, whereas the signal may or may not be present during a specified 
observation interval. Actually, the observer, who has advance knowledge 
of the ensemble of signals to be presented, says either "yes, a signal 
was present" or "no, a signal was not present" following each observa- 
tion. The fundamental problem will later be extended to include a third 
alternative - "I'm not sure, continue", but for now consider only the 


fundamental case. 


C. REPRESENTATION OF SENSORY INFORMATION 

Any observation may arise, with specific probabilities, either from 
noise alone or from signal plus noise. This is shown graphically in 
figure 5.1. The observation is labeled x and plotted on the abscissa. 
The left-hand distribution, flia د‎ represents the probability 
density that x will result given the occurrence of noise alone. The 
right-hand distribution, Eo ¥)» is the probability density function of 
x en the occurrence of signal plus noise. Since the observation will 
tend to be of greater magnitude when a signal is presented, the mean of 
the SN distribution will be greater than the mean of the N distribution. 
In general, the greater the amplitude of "" signal, the greater will 


be the separation of these means. 


D. OBSERVATION AS A VALUE OF A LIKELIHOOD RATIO 
In the discussion thus far it has been assumed that the observation 


may be represented along a single axis. That this is a valid assump- 
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Probability Density 


Observation, X 


Figure 5.1. The probability density function of noise 


and signal plus noise. 
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tion — in spite of the fact that the response of the visual system pro- 
bably has many dimensions - is explained in terms of some concepts that 
are basic to the theory. 

Assume that the response of the sensory system has several dimen- 
sions, and represent it as a point in an m-dimensional space. Call this 
point y. For each such point in this space there is some probability 
density that it resulted from noise alone, f (¥): and similarly some 
probability density that it was due to signal plus noise, foy) 
Therefore, there exists a likelihood ratio for each point in the space, 
A(y) - Eq 0/0), expressing the likelihood that the point y arose 
from SN relative to the likelihood that it arose from N. Since any 
point in the space, i.e. any sensory datum, may be thus represented as a 
real, non-zero number, these points may be considered to lie along a 
single axis. The observation x is thus identified with A(y) and the 
decision axis becomes the likelihood ratio. 

The assumption of a unidimensional decision axis is independent of 
the characteristics of the signal and noise. Rather, it depends on the 
fact that just two decision alternatives are considered. More generally, 
it can be shown that the number of dimensions required to represent the 
observation is M-l, where M is the number of decision alternatives con- 
Sidered by the ‘observer. 

Having established that the observation x may be identified with the 
Likelihood ratio A(y), it can also be shown-that we may equally well 
identify x with any monotonic transformation of A(y); i.e. nothing is 
lost by distorting the linear continuum as long as order is maintained. 
ii partreular, if it is assumed that a transformation of A(y) results in 
Gaussian density functions on x, the problem will be greatly simplified 


as will be seen shortly. The existence of such a transformation was 
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assumed in the representation of the density functions, foy £) and 

f Gx) in figure 5.1. A further assumption is that the SN function is 
simply a translation of the N function; i.e., that the two density 
functions have equal variances. This has been found to be not strictly 
true, but for the purpose of this discussion the small difference in the 
variances will not be relevant. 

To summarize the last few paragraphs, it has been shown that an 
observation may be characterized by a value of a likelihood ratio, 
A(y), i.e., the likelihood that the response of the sensory system y 
arose from SN relative to the likelihood that it arose from N. This 
permits viewing the observations as lying along a single axis. It was 
then assumed that a particular transformation of A(y) would result in 
normal probability density functions on x. The observer is assumed to 


nase his decisions on the variable x. 


E. DEFINITION OF THE CRITERION 

From the definition of the problem we see that the observer attemp- 
ting to detect signals in noise is faced with two alternate hypotheses. 
He must decide from which hypothesis the observation resulted, N or SN. 
The observer must establish a policy that defines the circumstances 
under which the observation will be regarded as resulting from each of 
the two possible events. He will establish a criterion, a cutoff x, On 
the continuum of observations, to which he can relate any given observa- 
tion X. If he finds for the n observation, X; that x; > x he says 
mons of Xi < X he says "N'. 

The observer's decision will be one of four outcomes: he will say 
"SN" or "N" and will in either case be right or wrong. The decision 


purcome. in other words, will be a "hit", a "miss", a "correct rejec- 
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tion", or a "false alarm". Clearly, these four possibilities are inter- 
dependent. For example, an increase in the probability of a hit can be 
achieved only by accepting an increase in the probability of a false 
alarm and decrease in the other probabilities. Thus, the balance de- 
sired by an observer in any instance will determine the optimal location 
SI his criterion, x The relationship of these four probabilities to 
one another is shown in figure 5.2, where the probability of a "hit" is 


represented by region f TA the probability of a "miss" by f "up 


SN SN 


(or B), the probability of a 'correct rejection" by f, * B, and the pro- 


N 
bability of a "false alarm" by A ° A (or Q). 

There are two conditional probabilities of principal interest. These 
are the conditional probability of the observer saying "SN" given the 
occurrence of a signal, Pog (A) and the probability of saying "SN" given 
the occurrence of noise alone, Py (42۰ These two are sufficient since the 
other two are merely their complements: Poy?) == Poy (A) and Py CB) 


1 - يف دا‎ The conditional and joint probabilities are related as 


follows: 


BO) _ GN * A) (5.1) 


(25252) 


where p(SN) is the a priori probability of Signal occurrence and p(N) - 
l - p(SN) is the a priori probability of occurrence of noise alone. 
With reference to figure 5.2, note that Poy (A) is the integral of foy) 
over the critical region A and P (A) is the integral of £4, O9 over A. 

In statistical decision theory, and in the theory of signal detecta- 


bility, the optimal criterion for a decision is specified in terms of the 
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Figure 5.2. The relationship between the four possible 


outcomes of a simple detection decision, 
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likelihood ratio. That is to say, it can be shown that the optimal cri- 
terion can always be specified by some value 0 of A(x). In other words, 
eril iCal region A that corresponds to the criterion contains all 
observations with likelihood ratio greater than or equal to 8, and none 
Se with likelihood ratio less than 9. To see that this is true, 
note that the expected value of a decision (denoted EV) is defined in 

in statistical decision theory as the sum, over the potential outcomes 
of a decision, of the products of the probability of outcome and desira- 
bility of outcome, Thus, using the notation V for positive ee ed 
values of the observer and K for costs on negative individual values, 


we have the following equation: 
EV - VSN. A p(SN*A) + Vrep P(N*B) — Kon p(SN*B) - N p(N*A) (323) 


If a priori and conditional probabilities from equations 5.1 and 5.2 
are substituted for the joint probabilites in equation 5.3, then collec- 
ting terms yields the results that maximizing EV is equivalent to maxi- 


mizing 
Be) LN (5.4) 


where 0 = 


(5.35) 





It can be shown that this value of 8 is equal to the value of the like- 
lihood ratio that corresponds to the optimal criterion. From equation 
5.4 it is seen that the value 0 simply weights the hits and false alarms 
and equation 5.5 shows that 0 is determined by the a priori probabili- 


ties of occurrence of signal and of noise alone and by the values asso- 
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ciated with the individual decision outcomes. Referring to figure 

5.2 and keeping in mind the result that © is a critical value of A(x), 
it should be clear that the optimal cutoff X. along the x axis is at the 
point on the axis where the ratio of the ordinate value of Foy) to the 
ordinate value of £ (x) is a certain number, n^mely 0. 

The value of 0 above specifies the optimal weighting of hits rela- 
tive to false alarms; x Should always be located at the point on the x- 
axis corresponding to ©. Just where this value of X will be with re- 
ference to the x-axis depends not only upon the overlap of the two den- 


sity functions (i.e., upon the signal strength). 


F. RECEIVER OPERATING CHARACTERISTICS 

Whatever criterion the observer actually uses, even if it is not an 
optimal criterion, it can be described by a single number — some value 
of likelihood ratio. By observing how the observer's performance is 
evaluated with respect to the location of his criterion, it can be seen 
how his performam e may be evaluated with respect to his sensory capa- 
bilities. | 

As previously noted, the fundamental quantities in the evaluation of 
the operators performance are P (4) and Pgy (42° Figure 5.3 shows a 
graph of Poy (A) versus P (A); each arc representing a different amount 
of overlap. Each curve in figure 5.3 corresponds to a given separation 


between the means of the density functions £ (x) and f, (x). The para- 


SN 
meter of these curves is labeled d', where d' is defined as the differ- 
ence between the means of the two density functions expressed in terms 
of their standard deviation, that is: 
په"‎ = "x 
Alle tee (5.6) 
ON) 


91 





| FW i o 


Te-che decision Criterion is set at the left of figure 5.2, we will 
obtain a point in the upper right-hand corner of figure 5.3; both Poy 6A) 
and Py A) are unity. If the criterion is set at the right end of the 
decision axis in figure 5.2, a point at the other extreme of figure 5.3 
results; both Poy (A) and pu CA are equal to zero. Between these two 
extremes lies the criterion of practical interest. The form of the 
curves in figure 5.3 is not the only one which might result, but it is 
the form which will result if the observer chooses a criterion in terms 
of likelihood ratio, and the probability density functions are or 
and of equal variance. 

In the context of the detection problem, figure 5.3 is usually re- 
ferred to as the receiver-operating-characteristic, or ROC curve. The 
optimal "operating level" is the point on a curve where its slope is 
equal to 0 (see equation 5.5). That is, the expression Pox (A) - 6p (A) 
defines a utility line of slope , and the point of tangency of this 
line to the ROC curve is the optimal operating level. Thus the theory 
specifies the appropriate "hit" eier and "false alarm" probability 
for any definition of optimum and any Set of parameters characterizing 
the detection situation. 

Since the separation between the means of the two density functions 
is a function of signal amplitude, d' is an index of the detectability 
of a given signal for a given observer. The variable d' is a measure 
of the observer's sensory capabilities or the effective signal strength. 
The criterion that is employed by the observer, which determines the 
Py 6A) and Poy (A) for some fixed d', reflects the effect of variables 
which have been variously called the set, attitude, or motives of the 


observer. 
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Figure 5.3. Typical ROC curves 
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G. SEQUENTIAL OBSERVATIONS 


If additional observations are allowed before a decision is required, 
then the observer is assumed to adopt two criteria, or three critical 
regions. These regions, at least those pertaining to the first observa- 
tion, are represented as A, B, and C in figure 5.4. If the first obser- 
vation falls in A the observer says "SN", if it falls in B he says "N", 
and if it falls in C he says "Continue". The theory of sequential analy- 
sis assumes that the successive observations are integrated and that the 
integrated value is compared with the three critical regions at each 
observation stage. Thus, a decision is made whenever the combined evidence 
for one or the other of the two hypotheses is sufficiently persuasive. 

It is clear that the relationship depicted in figure 5.4 holds only for 
the first observation in a sequence. A second observation is required 
only if the first falls in the intermediate region, C. The combination 
of the first and second observations, therefore, will not be normally 
distributed , for it results from the addition of a normal variable and 
a normal variable with both tails missing. Analysis of the inter-rela- 
tionships of the parameters is extremely difficult in the sequential 


observation problem because of this. 


H. USE OF IDEAL DESCRIPTIONS AS MODELS 

At first glance it might appear that a theory of ideal behavior is 
really not very useful when talking about a human observer. This is not 
the case, however; ideal behavior constitutes a convenient base from 
which to explore the complex behavior of a real organism. Ideal con- 
ditions generally involve few variables. Having identified the per- 


formance to be expected under ideal conditions, it is possible to extend 
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Figure 5.4. Representation of the sequential decision 


problem. 
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the model to include additional variables associated with the real 
world. In studies using human observers in the signal detection pro- 
blem, it has been found that the ideal theory closely approximates 
human behavior. The problem is one of altering the ideal model in 
some way so that it is slightly less than ideal. 

The human observer is able to adjust his detection criterion to a 
particular problem if the information is presented to him in the correct 
manner. That is, the leaming capabilities of the human observer can 
provide a great deal of flexibility to a signal detection system if he 
is used correctly. The value of this flexibility will be beneficial 
however, only if the values of the observer are such that they complement 
the system. The attitude and motives of the observer play a very impor- 
tant role in the effectiveness of the system. Recent advances in digital 
processing tel ie and computer graphics show great promise in con- 
trolling these parameters through increased processor gain and a man- 


machine interface never before possible with analog equipment. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 


This paper has discussed digital signal processing and spectral an- 
alysis techniques as applied to the detection of discrete narrowband sign- 
als in a nonstationary noise background. Specifically, the discussion 
was slanted toward application of a Fourier processor with integration 
to the Navy's ASW effort. There is a real need today for improved proces- 
sing techniques and it is apparent chat digital processors are the Most 
likely means to improvement. The fast Fourier transform algorithm has 
led to extensive revision in the methods of frequency domain analysis 
and real-time digital processors to the point that bandwidths that are 
sufficient for most problems are a reality today. 

It appears that the first attempt at using interactive graphics tec- 
hniques in the display of a passive ASW signal processor is being made in 
the "Spotlight" processor. Preliminary use of the display indicates that 
the operator now has a flexibility and an insight into the detection 
problem never before possible with previous displays. This particular 
combination of processing and display has tremendous possibilities in the 
field of ASW today. 

There are two questions fundamental to the ASW detection problen, 
which the author feels are of primary importance in determining the per- 
formance capabilities of a Fourier signal processor such as the "Spot- 
light" processor. The first is the question of the effects on the 
spectral estimates of integrating for relatively long periods of time 
over information which is known to be from a non-stationary background. 
The second is the human factors question of what is the optimal amount 


of redundancy required in the display for a processor such as this. 
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Answers, or at least partial answers, to these questions would aid 
tremendously in the actual implementation of this type of equipment. 
Research in these areas is highly recommended since it is only after 
these questions are answered that the full potential of such a system 


can be evaluated. 
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